Back

The Journal of Molecular Diagnostics

Elsevier BV

Preprints posted in the last 30 days, ranked by how well they match The Journal of Molecular Diagnostics's content profile, based on 36 papers previously published here. The average preprint has a 0.02% match score for this journal, so anything above that is already an above-average fit.

1
Evaluation of somatic variant calling methods on high coverage tumour-only amplicon sequencing data in a clinical environment

Bharne, D.; Gaston, D.

2026-04-11 bioinformatics 10.64898/2026.04.08.717310 medRxiv
Top 0.1%
7.4%
Show abstract

One of the current workhorses of next-generation sequencing in clinical molecular diagnostics laboratories for profiling somatic mutations in tumours are amplicon-based targeted sequencing panels. Many open-source somatic variant callers are available; however, their use in clinical applications remains under explored. Therefore, we integrated outputs of six variant callers (FreeBayes, MuTect2, Pisces, Platypus, VarDict and VarScan) into a Snakemake pipeline and evaluated tumour-only data from the HD789 commercial reference standard sequenced in triplicate on three different sequencing runs using the Illumina AmpliSeq Focus panel on MiSeq and NextSeq 2000. A 1:4 dilution sample was sequenced for evaluating limits of variant detection. The called variants were analysed along depth, allele frequency, and other sequencing metrics. The variant callers were evaluated by their level of concordance and performance on known somatic variants. FreeBayes consistently called the largest number of somatic variants in each sample but also included more potential artifacts. Overall, FreeBayes, VarScan, MuTect2, and Pisces had the best performance on HD789 data.

2
Nanopore Whole-Genome Sequencing for Rapid, Comprehensive Molecular Diagnostics of Brain Tumors in Adult Patients

Halldorsson, S.; Nagymihaly, R. M.; Bope, C. D.; Lund-Iversen, M.; Niehusmann, P.; Lien-Dahl, T.; Pahnke, J.; Bruning, T.; Kongelf, G.; Patel, A.; Sahm, F.; Euskirchen, P.; Leske, H.; Vik-Mo, E. O.

2026-04-24 pathology 10.64898/2026.04.23.26351563 medRxiv
Top 0.1%
6.4%
Show abstract

Background: Classification of central nervous system (CNS) tumors has become increasingly complex, raising concerns about the sustainability of comprehensive molecular diagnostics. We have evaluated nanopore whole genome sequencing (nWGS) as a single workflow to replace multiple diagnostic assays. Methods: We performed nWGS on DNA extracted from 90 adult CNS tumor samples (58 retrospective, 32 prospective) and compared the results to findings from standard of care (SoC) diagnostic work-up. Analysis was done through an automated workflow that consolidated diagnostically and therapeutically relevant genomic alterations, including copy-number variation, structural, and single-nucleotide variants, chromosomal aberrations, gene fusions, and methylation-based classification. Results: nWGS supported final diagnostic classification in all samples with >15% tumor cell content, requiring ~3 hours of hands-on library preparation, parallel sample processing, and sequencing times within 72 hours. Methylation-based classification was available within 1 hour and was concordant with the integrated final diagnosis in 89% of cases (80/90). All diagnostically relevant copy-number variations, single-nucleotide variants, and gene fusions were concordant with SoC testing. MGMT promoter methylation status matched in 94% of cases. In addition, nWGS identified prognostic and potentially actionable variants that were not reported or covered by SoC. Conclusions: nWGS delivers comprehensive genetic and epigenetic results with a fast turn-around compared to standard methods. This enables efficient, accurate, and scalable molecular diagnostics of CNS tumors using a single platform. This data supports its implementation in routine clinical practice and may be extended to other cancer types requiring complex genomic profiling.

3
Tm guided exon exon junction RT-PCR enables specific detection of RNA variants lacking easily distinguishable exonic regions

Ahn, J.; Zack, D.; Zhang, P.

2026-04-05 molecular biology 10.64898/2026.04.02.716213 medRxiv
Top 0.1%
6.4%
Show abstract

Accurate detection of RNA splice variants is often hindered when transcripts lack large distinguishable exonic regions, making conventional PCR strategies challenging. We developed a simple melting temperature (Tm)-guided exon-exon junction (EEJ) RT-PCR method to enable variant-specific detection under these conditions. Uni-directional primers spanning exon-exon junctions were designed so that approximately each half anneals to adjacent exons. The Tm of each half-site was set >7{degrees}C below the annealing temperature, preventing stable binding to individual exons and enforcing junction-dependent amplification. The method was evaluated using HTRA1-AS1 long noncoding RNA variants that share overlapping exon sequences but differ in splice connectivity. HTRA1-AS1 comprises five variants, only one with a large distinguishable exon. Tm-guided EEJ primers robustly discriminated the remaining four variants. After optimization, amplification yielded sharp, single bands with minimal cross-reactivity. Compared with conventional designs, this approach reduced heteroduplex and heteroquadruplex formation, improving band clarity. Sanger sequencing confirmed junction specificity, and the method performed well in multiplex settings. Overall, Tm-guided EEJ RT-PCR is a cost-effective, high-resolution approach for detecting RNA variants lacking easily distinguishable exonic regions, readily compatible with standard RT-PCR and qPCR workflows.

4
Whole-genome pre-amplification as a viable approach for genomic screening of FFPE-derived DNA samples

Guerrero Quiles, C.; Lodhi, T.; Sellers, R.; Sahoo, S.; Weightman, J.; Breitwieser, W.; Sanchez Martinez, D.; Bartak, M.; Shamim, A.; Lyons, S.; Reeves, K.; Reed, R.; Hoskin, P.; West, C.; Forker, L.; Smith, T.; Bristow, R.; Wedge, D. C.; Choudhury, A.; Biolatti, L. V.

2026-03-29 molecular biology 10.64898/2026.03.26.714414 medRxiv
Top 0.1%
4.1%
Show abstract

Whole-genome sequencing (WGS) enables comprehensive analysis of tumour genomes, but its use in formalin-fixed paraffin-embedded (FFPE) samples is limited by DNA fragmentation and low yields. Whole-genome amplification (WGA) methods such as multiple displacement amplification (MDA) can boost DNA availability but distort copy-number alteration (CNA) profiles. DNA ligation-mediated MDA (DLMDA) mitigates this bias by reconstituting fragmented templates, yet its performance in FFPE-derived DNA remains uncertain. We compared paired DLMDA pre-amplified (2h, 8h) and non-pre-amplified FFPE prostate tumour samples from 22 archival blocks (5, 15 and 20 years old). DLMDA increased DNA yield by 42- to 86-fold, with global CNA patterns largely preserved. However, DLMDA significantly reduced the number of detected CNA deletions and amplifications. These effects were independent of both block age and reaction time. CNA dropouts were randomly distributed across the genome, indicating that DLMDA does not introduce regional bias. Our results show that DLMDA enables robust DNA yield recovery and avoids false-positive CNA artefacts, but at the cost of reduced CNA sensitivity. While suitable for CNA screening pipelines through WGS, further improvements are required to minimise the false-negative risk and improve the techniques sensitivity for FFPE-based genomics.

5
Determinants of DNA-sequence-based Diagnostic Yield in the CSER Consortium

Mavura, Y.; Crosslin, D.; Ferar, K. D.; Lawlor, J. M.; Greally, J. M.; Hindorff, L.; Jarvik, G. P.; Kalla, S.; Koenig, B. A.; Kvale, M.; Kwok, P.-Y.; Norton, M.; Plon, S. E.; Powell, B. C.; Slavotinek, A.; Thompson, M. L.; Popejoy, A. B.; Kenny, E. E.; Risch, N.

2026-04-22 genetic and genomic medicine 10.64898/2026.04.20.26351140 medRxiv
Top 0.1%
4.0%
Show abstract

PurposeDiagnostic yield from exome and genome sequencing varies widely across studies. It remains unclear how much of this variation reflects patient-level factors (e.g., sex, clinical features, race/ethnicity, genetic ancestry) versus site-level practices such as sequencing modality or variant interpretation workflows. We aimed to quantify the contributions of these factors to diagnostic outcomes across five U.S. clinical sequencing sites. MethodsWe performed a cross-sectional analysis of 3,008 prenatal, neonatal, and pediatric cases from the NHGRI Clinical Sequencing Evidence-Generating Research (CSER) consortium (2017-2023). Clinical indications spanned neurodevelopmental, neurological, immunological, metabolic, craniofacial, skeletal, cardiac, prenatal, and oncologic presentations. Genetic ancestry was inferred from sequencing data, and variants were interpreted using ACMG/AMP guidelines to classify DNA-based diagnoses. Generalized linear mixed models were used to estimate associations between diagnostic yield and fixed effects (sex, prenatal status, isolated cancer, number of clinical indications, sequencing modality, race/ethnicity, and genetic ancestry), while modeling study site as a random effect to quantify between-site variation. ResultsThe overall diagnostic yield was 19.0%. Multiple clinical indications (OR=1.47, 95% CI 1.20-1.80, p<0.001) were associated with higher diagnostic yield, and male sex (OR=0.80, 95% CI 0.66-0.96, p=0.017) and prenatal status (OR=0.63, 95% CI 0.44-0.90, p=0.012) were associated with lower yield. Sequencing modality, race/ethnicity, genetic ancestry, and isolated cancer were not statistically significantly associated with diagnostic outcomes.. A model without fixed effects attributed [~]10% of variance in diagnostic yield to between-site differences. After adjusting for covariates, site-level variance decreased to 5.7%, indicating consistent variation across sites not explained by measured patient factors. ConclusionAcross five sites, patient-level clinical features influenced diagnostic yield, but substantial site-level variation remained even after adjustment. Differences in variant interpretation, or case-classification practices may contribute to this residual variability. Further efforts to increase consistency in exome- and genome-sequencing diagnostic workflows may help reduce inter-site differences.

6
A high-throughput Epstein-Barr virus nuclear antigen 1 (EBNA1) serology test strip for nasopharyngeal carcinoma risk screening

Warner, B. E.; Patel, J.; Satterwhite, R.; Wang, R.; Adams-Haduch, J.; Koh, W.-P.; Yuan, J.-M.; Shair, K. H. Y.

2026-04-13 infectious diseases 10.64898/2026.04.08.26350329 medRxiv
Top 0.1%
3.6%
Show abstract

PurposeAntibodies to Epstein-Barr virus (EBV) proteins can predict nasopharyngeal carcinoma (NPC) risk. We previously defined a prototype EBNA1 protein panel and multiplex immunoblot assay that distinguishes NPC risk several years pre-diagnosis. Assay throughput and specificity are critical to effectively implement a population-level screening program. Here, we developed a strip test assay - EBNA1 SeroStrip-HT - with an objective to increase throughput and maximize specificity. Experimental DesignEBNA1 full-length (FL) and glycine-alanine repeat deletion mutants (dGAr) were purified from insect and mammalian cells to screen serum IgA/IgG from prospective cohorts in Singapore and Shanghai, China, with known time intervals to NPC diagnosis. Twenty pre-diagnostic sera within 4 years to diagnosis were compared to 96 healthy controls using a nested case-control study design. ResultsIgA to mammalian-derived EBNA1 dGAr achieved 85.0% sensitivity and 94.8% specificity (AUC, 0.939) for NPC status. IgA to insect-derived EBNA1 dGAr showed the same sensitivity (85.0%) and similar specificity (93.8%) (AUC, 0.941). IgA to insect-derived EBNA1 FL had a higher 90% sensitivity, but lower 91.7% specificity (AUC, 0.940). Combining EBNA1 FL and dGAr results showed that subjects positive for both proteins had a 243.67 odds ratio for NPC incidence compared to double-negative scores. ConclusionThis study demonstrated the efficacy of EBNA1 SeroStrip-HT for NPC risk assessment and stratification in high- and intermediate-risk populations, yielding high accuracy and a 12-fold increased throughput over the prototype. The insect system was appropriate for large-scale production of purified EBNA1. Larger, geographically diverse cohorts are warranted to confirm these results, especially in low-incidence populations.

7
GPR143, a novel immunohistochemical marker for renal tumors with FLCN/TSC/MTOR-TFE alterations

Li, Q.; Singh, A.; Hu, R.; Huang, W.; Shapiro, D. D.; Abel, E. J.; Zong, Y.

2026-04-13 pathology 10.64898/2026.04.06.26350070 medRxiv
Top 0.1%
2.8%
Show abstract

Although several ancillary tests are available in limited laboratories, diagnosis of microphthalmia (MiT)/TFE family translocation renal cell carcinoma (tRCC) could be challenging due to diverse and overlapping tumor morphology and the lack of reliable biomarkers. GPNMB has been recently identified as a diagnostic marker for various renal neoplasms with FLCN/TSC/mTOR-TFE alterations. However, the sensitivity and specificity of GPNMB immunostain are suboptimal and the result interpretation in ambiguous cases could be difficult. To search additional biomarkers that could improve the screening sensitivity and predict genetic aberrations in FLCN/TSC/mTOR-TFE pathway in renal tumors, we performed bioinformatic analysis of publicly available cancer databases and found GPR143, a transmembrane protein regulated by MiT transcription factors, was highly expressed in a subset of renal cell carcinomas (RCCs). In two the Cancer Genome Atlas (TCGA) kidney cancer cohorts, RCCs with high levels of GPR143 expression were enriched for renal neoplasms with FLCN/TSC/mTOR-TFE alterations. Similar to GPNMB labeling, GPR143 immunostain was positive in the majority of tRCC cases and renal tumors with FLCN/TSC/mTOR alterations, suggesting that GPR143 could function as another surrogate marker for FLCN/TSC/mTOR-TFE alterations in certain renal tumors. Interestingly, despite the concordant GPR143 and GPNMB immunoreactivity in most renal neoplasms with FLCN/TSC/mTOR-TFE alterations, diffuse GPR143 immunostain was observed in some cases with negative or focal GPNMB labeling. Taken together, our results indicate GPR143 could serve as a useful adjunct marker to improve the sensitivity for screening renal tumors with FLCN/TSC/mTOR-TFE alterations.

8
Clinicopathological Factors Associated with Gastric Signet Ring Cell Carcinoma in CDH1 Pathogenic Variant Carriers: Report from the GASTRIC Consortium

Gilad, O.; Drogan, C. M.; Keel, E.; Gao, G.; Swallow, C.; Govindarajan, A.; Brar, S.; Heller, M.; Apostolico, T.; Jacobs, M. F.; Gofar, K.; Dudley, B.; Karloski, E.; Lombardi, C.; Springer, M.; Saha, S.; Cox, D.; Lerner, B. A.; Hanna, G.; Chertock, Y.; Khan, A.; Ertan, S.; Hilfrank, K.; Rustgi, S. D.; Singh, A.; Hall, M. J.; Llor, X.; Bansal, A.; Patel, S. G.; Brand, R. E.; Roberts, M. E.; Stanich, P. P.; Stoffel, E.; Katona, B. W.; Aronson, M.; Kupfer, S. S.

2026-04-02 gastroenterology 10.64898/2026.03.27.26349321 medRxiv
Top 0.1%
2.7%
Show abstract

Background: Gastric cancer surveillance in CDH1 pathogenic variant carriers is challenging, as predictors of localized (stage T1a) and advanced (stage >T1a) signet ring cell carcinoma (SRCC) are not well defined. We established the Group of investigAtors STriving toward Research In CDH1 (GASTRIC) consortium to identify clinicopathological factors associated with localized and advanced SRCC. Methods: A retrospective observational study (1998-2025) of CDH1 carriers across twelve academic centers was performed. Clinical, endoscopic, and pathological data were compared between carriers with and without SRCC on endoscopy, and between those with advanced versus localized or no cancer on gastrectomy specimens. Results: Overall, 390 CDH1 carriers from 235 families were included. Presence of SRCCs on endoscopy was significantly associated with thickened folds, nodularity, masses, and intestinal metaplasia, while gastritis was negatively associated. Of 196 carriers (52.4%) undergoing gastrectomy, 11 (5.6%) had advanced cancers, 10(90.9%) of which showed endoscopic abnormalities. Identification of SRCC on baseline endoscopy was the most sensitive feature for advanced disease (0.81) but had moderate specificity (0.74), whereas masses and thickened folds were highly specific (0.99 and 0.96, respectively) but less sensitive. Negative predictive values were high (0.94-1.0), while positive predictive values were modest (0.13-0.66). On multivariate analysis, masses and SRCC foci on baseline endoscopy were independent predictors of advanced disease. Conclusion: Among CDH1 carriers, absence of endoscopic findings was reassuring, whereas significance of detected endoscopic and pathological abnormalities was less certain. Advanced cancer occurred in a small number of carriers, with endoscopic abnormalities in nearly all cases. Endoscopic surveillance might be an alternative to surgery in carriers without worrisome mucosal findings.

9
Evaluating the CellSearch CMMC Assay for Non-Invasive Longitudinal MRD Monitoring

Powell, S.; Bui, T.; Gullipalli, D.; LaCava, M.; Jones, S. M.; Hansen, T.; Kuhr, F.; Swat, W.; Simandi, Z.

2026-04-02 hematology 10.64898/2026.03.28.26349025 medRxiv
Top 0.1%
2.5%
Show abstract

Current clinical management of multiple myeloma (MM) relies on bone marrow (BM) biopsies for minimal residual disease (MRD) assessment. While BM biopsies are the gold standard, their invasive nature and potential to miss extramedullary or patchy disease necessitate sensitive, non-invasive liquid biopsy platforms. In this study, we evaluated the analytical performance of the CellSearch CMMC assay to determine its utility for deep-MRD monitoring. Using a standard 4 mL whole blood input, the assay achieves a WBC-normalized sensitivity of 2.45 x 10-7, supported by a limit of quantitation of 5 cells per run. Given this high analytical sensitivity, the assay provides a robust negative predictive value, rendering false-negative findings highly unlikely in populations with detectable peripheral disease. These findings characterize the CellSearch CMMC assay as a highly sensitive, analytically validated platform for non-invasive deep-MRD level longitudinal surveillance monitoring. When integrated into a clinical workflow that accounts for its specificity profile, the platform offers a patient-friendly complement to serial BM biopsies, with the potential to reduce their frequency in appropriate clinical contexts.

10
Quantifying PD1 saturation by PDL1 in tumor tissue using a novel RNA aptamer-based assay

Veeramani, S.; Yin, C.; Yu, N.; Coleman, K. L.; Smith, B. J.; Weiner, G. J.

2026-04-08 immunology 10.64898/2026.04.06.716702 medRxiv
Top 0.1%
2.2%
Show abstract

BackgroundTherapeutic agents targeting the PD1-PDL1 interaction are of great clinical value, however accurately predicting which patients are most likely to benefit is challenging. Improved predictive biomarkers for anti-PD1 therapy are clearly needed. Quantifying PD1 saturation by PDL1 in tumor tissue has the potential to serve as such a biomarker. Here we report a novel bioassay called the PD1 Ligand Receptor Complex Aptamer (LIRECAP) assay and demonstrate it can be used to quantify the saturation of PD1 by PDL1 in formalin-fixed paraffin-embedded tumor biospecimens. ResultsThe PD1 LIRECAP assay was developed by identifying a pair of RNA aptamers. One aptamer preferentially binds to unoccupied PD1 (P aptamer) and the other to the PD1-PDL1 complex (C aptamer). P and C aptamers were added together to a formalin-fixed sample, and bound aptamer extracted. A 2-color qRT-PCR assay using a single set of primers was used to determine the ratio of the sample-bound C to P aptamers (C:P ratio) which reflected PD1 saturation by PDL1 in the sample. Quantification of PD1 saturation by PDL1 as determined by the PD1 LIRECAP assay correlated closely with PD1-mediated signaling and PD1-PDL1 proximity. Analysis of sarcoma FFPE biospecimens confirmed the assay is technically reproducible on clinical biospecimens. There were significant differences in PD1 saturation by PDL1 between patients as well as considerable intratumoral heterogeneity. ConclusionsThe PD1 LIRECAP assay is novel assay that can be used to quantify PD1 saturation by PDL1 in clinical biospecimens. The assay is technically feasible, reproducible, and has the potential to serve as a superior predictive biomarker for PD1/PDL1-based therapy. Similar assays based on this platform could be used in other systems and settings to quantify interaction between two molecules.

11
Diagnostic Classification for Long Covid Patients identifying Persistent Virus and Hyperimmune Pathophysiologies

James-Pemberton, P.; Harper, D.; Wagerfield, P.; Watson, C.; Hervada, L.; Kohli, S.; Alder, S.; Shaw, A.

2026-04-22 infectious diseases 10.64898/2026.04.21.26351402 medRxiv
Top 0.2%
1.8%
Show abstract

A multiplex diagnostic test is evaluated for self-reported long COVID associated persistent symptoms and a poor recovery from a SARS-CoV-2 infection. A mass-standardised concentration of total antibodies (AC), high-quality (HQ) antibodies and percentage of HQ antibodies (HQ%) is assessed against a spectrum of spike proteins to the SARS-CoV-2 variants: Wuhan, , {delta}, and the Omicron variants BA.1, BA.2, BA.2.12.1, BA.2.75, BA.5, CH.1.1, BQ.1.1 and XBB.1.5 in three cohorts. A cohort of control patients (n = 46) recovered (CC) and a cohort of self-declared long COVID patients (n = 113) (LCC). A nested Receiver Operating Characteristic (ROC) analysis, performed for the variant with lowest HQ concentration in the spectrum, produced an area under the curve and AUC = 0.61 (0.53-0.70) for the CC vs LCC cohorts. For the LCC cohort, the cut-off thresholds for AC = 0.8 mg/L, HQ = 1.5 mg/L and HQ% of 34% were determined, leading to a 71% sensitivity and 66% specificity derived by the Youden metric. The cohorts may be fully classified based on ROC and outlier analysis to give an incidence of persistent virus 62% (95% CI 52% - 71%), hyperimmune 12% (95% CI 7% - 20%) and unclassified, 26% (95% CI 18% - 35%). The overall diagnostic accuracy for both the hyper and hypo immune is 69%. All clinical interventions can now be tailored for the heterogenous long COVID patient cohort.

12
DNA methylation signatures of mismatch repair-deficient colorectal cancer

Ward, R.; Endicott, M.; Mallabar-Rimmer, B.; Burrage, J.; Sherwood, K.; Huang, Q.; Ward, J. C.; Thorn, S.; Woolley, C.; Wood, S.; Dempster, E.; Green, H. D.; Tomlinson, I.; Webster, A. P.

2026-04-13 cancer biology 10.64898/2026.04.09.717165 medRxiv
Top 0.3%
1.3%
Show abstract

BackgroundColorectal cancer (CRC) is a molecularly heterogeneous disease shaped by both genetic and epigenetic alterations. Approximately 15% of CRCs display widespread CpG island hypermethylation, known as the CpG Island Methylator Phenotype (CIMP). CIMP-high (CIMP-H) tumours frequently exhibit MLH1 promoter hypermethylation, leading to mismatch repair deficiency (MMRd) and microsatellite instability (MSI). However, DNA methylation patterns associated with MSI, independent of CIMP and MLH1 silencing, and the influence of clinical variables such as anatomical location and patient age on the CRC methylome remain poorly characterised. MethodsWe performed epigenome-wide DNA methylation profiling of 259 primary CRC tissue samples using the Illumina EPICv2 array, comparing differential methylation between MSI and microsatellite stable (MSS) CRC, adjusting for tumour purity, MLH1 promoter methylation, CIMP status, and anatomical location, to account for known confounders. We further evaluated the independent effects of anatomical location and patient age on global methylation patterns. ResultsEpigenome-wide differential methylation between MSS and MSI CRC was dominated by MLH1 promoter hypermethylation. After adjusting for MLH1 hypermethylation and CIMP status, we identified a distinct set of 656 CpG sites associated with MMRd independent of MLH1 silencing. These included hypermethylation at LRP6, GSK3{beta}, and CDK12, implicating altered WNT signalling and transcriptional regulation pathways. Comparison of MSI subgroups revealed the co-occurrence of MLH1 hypermethylation with promoter hypermethylation at TXNRD1. Anatomical location showed a strong independent effect on methylation patterns, while we observed only modest effects of patient age on the CRC methylome after adjustment for confounders. ConclusionsWe identified a distinct methylation profile distinguishing MSS and MSI CRC, including MLH1-independent markers of MMRd, as well as novel differentially methylated loci within MSI subgroups. We further showed that anatomical location has a strong independent impact on the CRC methylome. Together, these findings refine the molecular characterisation of CRC and highlight potential epigenetic markers that could inform patient stratification and precision oncology.

13
Structured Error Analysis and Corrective Actions in Clinical Laboratory Practice: An Analysis of 7226 External Quality Assurance Participations

Strasser, B.; Mustafa, S.; Holly, M.; Grünberger, M.; Anita, S.

2026-04-04 health systems and quality improvement 10.64898/2026.04.02.26350023 medRxiv
Top 0.3%
1.3%
Show abstract

Background: External Quality Assurance (EQA) is an essential component of modern laboratory medicine. Current scientific evidence on EQA focuses primarily on the analyses carried out by EQA providers while relatively little research has been conducted in individual clinical laboratories. Methods: In this retrospective single-center observational study in a clinical laboratory, EQA results were analyzed over a period of four years (2021-2024). The evaluation was based on EQA action reports documented in the institutes internal quality management system. Deviations were classified according to department, type of discrepancy, root cause category (analytical, preanalytical, systemic, unidentifiable), and measures taken. Results: A total of 7226 EQA participations were evaluated during the observation period. The overall error rate remained consistently low, ranging between 0.8% and 1.6%, with no significant change over time (p = 0.87). Most deviations occurred in the departments of clinical chemistry and immuno/autoimmune diagnostics (p < 0.001). These were predominantly quantitative discrepancies (false low/false negative or false high/false positive). Root cause analysis showed a clear dominance of analytical causes (p < 0.001), while preanalytical and systemic causes were identified less frequently. In most cases, corrective measures, such as re-analyses, recalibrations, process adjustments, or staff training, were implemented promptly. Hard structural measures, such as changing methods or discontinuing tests, were rarely necessary. Conclusion: In a clinical laboratory, EQA is an important tool for structured error analysis and continuous quality improvement. Consistent processing of deviating EQA results goes hand in hand with stable analytical performance and a low error rate.

14
Inherited genetic risk factors in young-onset lung cancer

Esai Selvan, M.; Gould Rothberg, B. E.; Patel, A. A.; Sang, J.; Horowitz, A.; Christiani, D. C.; Klein, R. J.; Gumus, Z. H.

2026-04-15 genetic and genomic medicine 10.64898/2026.04.14.26350822 medRxiv
Top 0.3%
1.2%
Show abstract

Introduction Lung cancer is rare before age 45, and its inherited genetic basis remains poorly defined. Methods We performed whole-genome sequencing in 171 predominantly young-onset lung cancer patients and integrated these data with whole-exome sequencing from six major lung cancer consortia, yielding 9,065 patients. After quality control, analyses focused on 6,545 individuals of European ancestry, the largest ancestral group. We compared the prevalence of rare pathogenic and likely pathogenic (P/LP) germline variants between 186 young-onset (age <45 years) and 6,359 older patients at gene and gene-set levels using Fisher's exact test, stratified by histology, sex, and smoking status. Polygenic risk scores (PRS) derived from common variants were also evaluated. Results Young-onset patients carried a higher burden of rare germline P/LP variants in DNA damage response (DDR) genes (including BRIP1, ERCC6, MSH5), and in cilia-related genes, notably GPR161. At the pathway level, DDR genes were significantly enriched (OR=1.66, p=0.007), with the strongest signal in the Fanconi Anemia pathway and among females (OR=1.96, p=0.01). Enrichment was also observed in inborn errors of immunity pathways, with strongest signals in antibody deficiency and the complement system genes. Young-onset patients additionally exhibited higher lung cancer PRS. Conclusion Young-onset lung cancer exhibits a distinct germline genetic architecture, characterized by enrichment of rare P/LP variants in DDR, cilia-related, and immune pathways, and an elevated lung cancer PRS. These findings support a greater role for inherited susceptibility in early-onset disease and have implications for risk stratification, earlier screening, and precision prevention.

15
RNase HII-assisted amplification (RHAM) for rapid point-of-care monkeypox detection

Kamhieh-Milz, J.; Kamhieh-Milz, S.; Schwarz, F.; Michel, J.; Nitsche, A.; Puyskens, A.

2026-04-02 infectious diseases 10.64898/2026.04.01.26349928 medRxiv
Top 0.3%
1.2%
Show abstract

Mpox poses an ongoing global public health threat, with case numbers rising beyond traditionally endemic regions in Central and Western Africa. Rapid detection of the causative agent, the Monkeypox virus (MPXV), is critical for outbreak control, yet laboratory infrastructure and trained personnel remain scarce in many affected areas. Point-of-care molecular diagnostics offer a practical solution by enabling timely testing without specialized equipment or elaborate nucleic acid extraction. We evaluated the performance of an extraction-free RNase HII-assisted amplification (RHAM) assay for MPXV detection by Pluslife Biotech, a novel isothermal amplification technology providing results in under 30 minutes. The Pluslife RHAM test demonstrated pan-MPXV clade reactivity, detecting all four MPXV clades (Ia, Ib, IIa, IIb) with high analytical sensitivity and no cross-reactivity to other poxviruses or other clinically relevant pathogens. The assay proved compatible with diverse clinical specimen types, including lesion swabs, oropharyngeal swabs, rectal swabs, urine, semen, and wound exudate. As part of routine diagnostics at the German Consultant Laboratory for Poxviruses, in a comprehensive evaluation of 206 clinical specimens against diagnostic real-time PCR, the Pluslife RHAM test achieved a diagnostic sensitivity of 94.2% (95% CI: 85.8-98.4%) and a specificity of 100% (95% CI: 97.3-100%). Notably, samples with higher viral loads (Ct <30) showed 100% sensitivity. Time-to-result correlated significantly with viral load, enabling faster diagnosis in high-viral-load cases. The Pluslife RHAM test represents a practical, sensitive, and rapid point-of-care solution for MPXV detection in resource-limited settings, combining strong analytical performance with operational simplicity to support timely outbreak response and clinical decision-making.

16
Artificial Intelligence-Driven Identification of Age- and Treatment-Specific TP53 and PI3K Alterations in Pancreatic Ductal Adenocarcinoma

Diaz, F. C.; Waldrup, B.; Carranza, F. G.; Manjarrez, S.; Velazquez-Villarreal, E.

2026-04-11 gastroenterology 10.64898/2026.04.07.26350355 medRxiv
Top 0.3%
0.9%
Show abstract

BackgroundDespite extensive characterization of key oncogenic drivers, pancreatic ductal adenocarcinoma (PDAC) continues to exhibit profound molecular heterogeneity and inconsistent responses to standard therapies, including gemcitabine. The role of pathway-level alterations, particularly in the context of age at onset and therapeutic exposure, remains insufficiently defined. MethodsIn this study, we leveraged a conversational artificial intelligence framework (AI-HOPE-TP53 and AI-HOPE-PI3K) to enable precision oncology, driven interrogation of clinical and genomic data from 184 PDAC tumors, stratified by age at diagnosis and gemcitabine exposure. Using AI-enabled cohort construction and pathway-centric analyses, we evaluated alterations in TP53 and PI3K signaling networks, with findings validated through conventional statistical methods. ResultsTP53 pathway analysis revealed a significantly higher frequency of TP53 mutations in early-onset compared to late-onset PDAC among gemcitabine-treated patients (86.7% vs. 57.1%, p = 0.04), with a similar trend observed between treated and untreated early-onset cases (86.7% vs. 40%, p = 0.07). Notably, in late-onset PDAC patients not treated with gemcitabine, absence of TP53 pathway alterations was associated with improved overall survival (p = 0.011). Complementary analyses of the PI3K pathway demonstrated a higher prevalence of pathway alterations in late-onset gemcitabine-treated tumors compared to untreated counterparts (13.2% vs. 2.7%, p = 0.02). Importantly, among late-onset patients not receiving gemcitabine, those without PI3K pathway alterations exhibited significantly improved overall survival (p < 0.0001). ConclusionTogether, these findings identify distinct TP53 and PI3K pathway dependencies that are modulated by both age-of-onset and treatment exposure in PDAC. This work highlights the utility of conversational artificial intelligence in enabling rapid, integrative, and hypothesis-generating analyses within a precision oncology framework, supporting the identification of clinically relevant molecular stratification strategies for this aggressive disease.

17
Germline VCF Annotator: a lightweight pipeline for processing germline VCFs with robust variant extraction and read evidence quality control

Manojlovic, Z.

2026-04-09 bioinformatics 10.64898/2026.04.06.716730 medRxiv
Top 0.4%
0.9%
Show abstract

Raw variant calls are typically distributed as VCF files and are not well-suited for direct human review. They are intended for programmatic parsing, and spreadsheet import can distort data through automatic type conversion. Furthermore, variants in VCF are commonly annotated to add gene context and predicted functional consequences. Ensembl VEP, a widely used standard for transcript-aware variant annotation, was adapted in this study to generate standardized consequence fields across genomic features. Using a colon crypt whole-genome sequencing cohort as the motivating dataset, this study examined whether variation at DNA damage response and repair (DDR) loci could contribute to mutation-burden patterns in normal colon crypts, including patterns associated with age and potential treatment-related exposure. To make this question testable in a reproducible table-based format, the Germline VCF Annotator was developed as a two-step workflow that normalizes germline VCFs, generates VEP tabular annotations with explicit allele fields, and then extracts variants of interest and appends read-evidence metrics to assign a rules-based QC class. Within-patient concordance across technical repeats at predefined DDR loci was near-perfect after filtering for nonsilent SNVs with read depth [&ge;]15, with discordance concentrated among Low-QC loci. Bulk and crypt-derived samples showed no age-related trend in DDR burden. Although the demonstration centers on DDR and aging, the Germline VCF Annotator is applicable to other gene sets that require human-readable locus-level summaries with retained allele provenance and read evidence.

18
Genetic loss of JAK1 and cutaneous HPV infection

Fan, S.-Q.; Wang, R.-R.; Colombo, R.; Tang, K.-C.; Liu, J.-W.; Pontoglio, A.; Zhang, L.-L.; Li, K.; Han, S.-R.; Zhang, H.; Bai, X.; Yu, X.; Habulieti, X.; Liu, K.-Q.; Sun, Y.; Sun, L.-W.; Liu, H.; Sun, M.; Lin, Z.-M.; Zhang, F.-R.; Ma, D.-L.; Zhang, X.

2026-04-08 genetic and genomic medicine 10.64898/2026.04.03.26350014 medRxiv
Top 0.4%
0.9%
Show abstract

Background: Human papillomaviruses (HPVs) pose a severe threat to global public health by driving nonmelanoma skin cancer (NMSC) and cervical cancer, with NMSC being one of the most common cancers worldwide. Epidermodysplasia verruciformis (EV) is an inborn error of immunity characterized by an increased susceptibility to persistent infection of cutaneous HPV and a high risk of NMSC. The genetic basis remains unknown in many patients with EV. Methods: We collected four unrelated pedigrees with EV. Genetic analysis identified five variants in JAK1 encoding the Janus kinase 1. Ex vivo models and patient-derived tissue were employed to evaluate the functional effects of JAK1 variants and delineate the pathogenic mechanisms. Results: We identified different variants in JAK1 in four pedigrees with dominant EV. Genetic analysis revealed five novel variants in JAK1, three of which resulted in nonsense-mediated mRNA decay (NMD). Functional assays identified a decreased phosphorylation of the signal transducers and activators of transcription (STATs), impaired interferon responses, and defective T cell activation. Immune dysregulation in patients, characterized by a reduced CD4/CD8 T cell ratio, decreased CD8 naive T cell proportion, and accumulated memory T cells, implies impaired antiviral immunity against HPV. Conclusions: Our findings confirm that JAK1 loss-of-function (LOF) variants underlie susceptibility to cutaneous HPV infection. [Funded by the National Natural Science Foundation of China (81788101, 81230015, 82394420, and 82394423), the National Key Research and Development Program of China (2022YFC2703900), the CAMS Innovation Fund for Medical Sciences (2021-I2M-1-018), and the Regione Lombardia, Italy (Innovative Research Project 1137-2010)].

19
Introducing the digital PCR data essentials standard to harmonize data structure for clinical and research use

Trypsteen, W.; Vynck, M.; Untergrasser, A.; Whale, A. S.; Rodiger, S.; Dobnik, D.; Bogozalec Kosir, A.; Milavec, M.; Kubista, M.; Pfaffl, M. W.; Nour, A. A.; Young-Kyung, B.; Bustin, S. A.; Calin, G.; Chen, Y.; Cleveland, M. H.; De Falco, A.; Forootan, A.; O'Sullivan, D. M.; Devonshire, A. S.; Foy, C. A.; Fraley, S. I.; Gleerup, D. G.; He, H.-J.; Hellemans, J.; Lievens, A.; Lind, G. E.; Porco, D.; Romsos, E. L.; Thas, O.; Drandi, D.; de Tayrac, M.; Taly, V.; Huggett, J. F.; Vandesompele, J.; De Spiegelaere, W.

2026-04-13 bioinformatics 10.64898/2026.04.10.717368 medRxiv
Top 0.4%
0.9%
Show abstract

Digital PCR (dPCR) is a powerful technology for absolute quantification of nucleic acids, valued for its accuracy, sensitivity, and repeatability. Yet, the commercialization of different instruments with proprietary software has introduced challenges to data analysis, interoperability, and comparability. Therefore, we present the Digital PCR Data Essentials Standard (DDES) - a lightweight, human- and machine-readable, and cross-platform data standard developed in collaboration with the dPCR community. The standard consists of three file types designed to enable both manual inspection and automated analysis: (i) a main file summarizing experiment and reaction-level (meta-)data; (ii) an assay file describing targets and detection chemistry, and (iii) intensity files capturing partition-level raw fluorescence data per reaction. DDES supports a wide range of current dPCR applications, including singleplex and multiplex assays, endpoint and real-time readouts, and will be curated to implement future dPCR developments. By harmonizing the data structure, DDES lays out the foundation for FAIR dPCR data practices and supports improved software compatibility, collaborative and reproducible research, and future dPCR data repositories.

20
Barcode Crosstalk in ONT Multiplex Sequencing: Quantification and Mitigation Strategies

Scharf, S. A.; Spohr, P.; Ried, M. J.; Haas, R.; Klau, G. W.; Henrich, B.; Pfeffer, K.

2026-03-28 molecular biology 10.64898/2026.03.27.714689 medRxiv
Top 0.4%
0.9%
Show abstract

Multiplexing samples in long-read sequencing with Oxford Nanopore Next Generation Sequencing Technology (ONT) by ligating specific native barcodes to individual DNA samples enables significant increases of high throughput sequencing combined with a significant reduction of sequencing costs. However, this advantage carries the risk of barcode misassignment / crosstalk. Employing ONT multiplex sequencing with samples, we observed misassigned barcodes so called barcode crosstalk, after ONT library preparation according to the standard protocol, particularly in samples with low input DNA concentrations. We assumed that these barcode misassignments are largely due to misligation of remaining native barcodes during subsequent the subsequent sequencing adapter ligation. To systematically investigate and quantify barcode crosstalk, genomic DNA (gDNA) from four bacterial type strains with different DNA input concentrations was prepared using three protocols for library preparation: the Nanopore standard protocol (protocol A: version valid until July 2, 2025) the new Nanopore protocol (protocol B: version from July 2, 2025), and an in house protocol with pooling of the barcoded samples only after the sequencing adapter ligation step (protocol C: in house). All samples were sequenced on a Nanopore PromethIon device. The results clearly showed that the use of protocol A resulted in a pronounced barcode crosstalk especially detectable in samples with low DNA input concentrations (up to 2.4% misassigned reads). The ONT adjustment in protocol B (altered washing buffer vs. protocol A) significantly alleviated the barcode crosstalk to below 0.01%, whereas protocol C eliminated barcode crosstalk virtually completely. These observations emphasize that sequencing results obtained with older ONT native barcoding protocol variants should be critically reviewed. The newer ONT barcoding protocol is preferable for sequencing, but it does not completely eliminate the barcode crosstalk effect. In conclusion, for low DNA input and high accuracy sequencing, protocol C is recommended.